Sketch-to-Text Generation: Toward Contextual, Creative, and Coherent Composition

نویسنده

Yejin Choi

چکیده

The need for natural language generation (NLG) arises in diverse, multimodal contexts: ranging from describing stories captured in a photograph, to instructing how to prepare a dish using a given set of ingredients, and to composing a sonnet for a given topic phrase. One common challenge among these types of NLG tasks is that the generation model often needs to work with relatively loose semantic correspondence between the input prompt and the desired output text. For example, an image caption that appeals to readers may require pragmatic interpretation of the scene beyond the literal content of the image. Similarly, composing a new recipe requires working out detailed how-to instructions that are not directly specified by the given set of ingredient names. In this talk, I will discuss our recent approaches to generating contextual, creative, and coherent text given a relatively lean and noisy input prompt with respect to three NLG tasks: (1) creative image captioning, (2) recipe composition, and (3) sonnet composition. A recurring theme is that our models learn most of the end-to-end mappings between the input and the output directly from data without requiring manual annotations for intermediate meaning representations. I will conclude the talk by discussing the strengths and the limitations of these types of data-driven approaches and point to avenues for future research.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Impact of Contextual Clue Selection on Inference

Linguistic information can be conveyed in the form of speech and written text, but it is the content of the message that is ultimately essential for higher-level processes in language comprehension, such as making inferences and associations between text information and knowledge about the world. Linguistically, inference is the shovel that allows receivers to dig meaning out from the text with...

متن کامل

Automated Generation of Graphic Sketches by Example

Hand-crafting effective visual presentations is time-consuming and requires design skills. Here we present a case-based graphic sketch generation algorithm, which uses a database of existing graphic examples (cases) to automatically create a sketch of a presentation for a new user request. As the first case-based learning approach to graphics generation, our work offers three unique contributio...

متن کامل

ROBODANZA: Live Performances of a Creative Dancing Humanoid

The paper describes the artistic performances obtained with a creative system based on a cognitive architecture. The performances are executed by a humanoid robot whose creative behaviour is strongly influenced both by the interaction with human dancers and by internal and external evaluation mechanisms. The complexity of such a task requires the development of robust and fast algorithms in ord...

متن کامل

The role of creative economics in entrepreneurship and revenue generation of public libraries: A systematic review

Purpose: The present study was conducted to identify the status of research on entrepreneurship and income generation in public libraries with a focus on creative economics. Method: The present study is a systematic study. The statistical population of this study was all the researches done in the field of creative economy in public libraries that have been published in connection with entrepr...

متن کامل

Sketch-to-Image Generation Using Deep Contextual Completion

When the input to pix2pix translation [9] is a badly drawn sketch, the output follows the input edges due to the strict alignment imposed by the translation process. In this paper we propose sketch-to-image generation, where the output edges do not necessarily follow the input edges. We address the image generation problem using a novel joint image completion approach, where the sketch provides...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Sketch-to-Text Generation: Toward Contextual, Creative, and Coherent Composition

نویسنده

چکیده

منابع مشابه

The Impact of Contextual Clue Selection on Inference

Automated Generation of Graphic Sketches by Example

ROBODANZA: Live Performances of a Creative Dancing Humanoid

The role of creative economics in entrepreneurship and revenue generation of public libraries: A systematic review

Sketch-to-Image Generation Using Deep Contextual Completion

عنوان ژورنال:

اشتراک گذاری